Learning to Focus Attention on Discriminative Regions for Object Detection
نویسندگان
چکیده
A major task of visual attention is to focus processing on regions of interest to enable rapid and robust object search. Instead of integrating generic feature extraction into object specific interpretation we strictly pursue a top-down approach. Early features are tuned to selectively respond to task related visual features, i.e., locally discriminative information that is useful in object recognition. In this work we determine discriminative regions from the information content in the local appearance patterns. A rapid mapping from appearances to discriminative regions is estimated using decision trees. The focus of attention on discriminative patterns enables then efficient detection of a searched object, but also the definition of sparse object representations to respond only to task relevant information. In the experiments, the performance in object recognition from single imagettes dramatically increased considering only discriminative patterns. Evaluation of complete image analysis under various degrees of partial occlusion and image noise resulted in highly robust recognition even in the presence of severe occlusion and noise effects. Finally, preliminary results on attentive object detection in cluttered environments demonstrated successful indexing to relevant locations.
منابع مشابه
Attentive Object Detection Using an Information Theoretic Saliency Measure
A major goal of selective attention is to focus processing on relevant information to enable rapid and robust task performance. For the example of attentive visual object recognition, we investigate here the impact of top-down information onmulti-stage processing, instead of integrating generic visual feature extraction into object specific interpretation.We discriminate between generic and spe...
متن کاملFast Fine-grained Image Classification via Weakly Supervised Discriminative Localization
Fine-grained image classification is to recognize hundreds of subcategories in each basic-level category. Existing methods employ discriminative localization to find the key distinctions among similar subcategories. However, existing methods generally have two limitations: (1) Discriminative localization relies on region proposal methods to hypothesize the locations of discriminative regions, w...
متن کاملVisual-textual Attention Driven Fine-grained Representation Learning
Fine-grained image classification is to recognize hundreds of subcategories belonging to the same basic-level category, which is a highly challenging task due to the quite subtle visual distinctions among similar subcategories. Most existing methods generally learn part detectors to discover discriminative regions for better classification accuracy. However, not all localized parts are benefici...
متن کاملTriplet-Center Loss for Multi-View 3D Object Retrieval
Most existing 3D object recognition algorithms focus on leveraging the strong discriminative power of deep learning models with softmax loss for the classification of 3D data, while learning discriminative features with deep metric learning for 3D object retrieval is more or less neglected. In the paper, we study variants of deep metric learning losses for 3D object retrieval, which did not rec...
متن کاملObject Detection in Real Images
Object detection and recognition are important problems in computer vision. Since these problems are meta-heuristic, despite a lot of research, practically usable, intelligent, real-time, and dynamic object detection/recognition methods are still unavailable. We propose a new object detection/recognition method, which improves over the existing methods in every stage of the object detection/rec...
متن کامل